Speech recognition using temporally connected kernels in mixture density hidden Markov models

نویسنده

Panu Somervuo

چکیده

A method is presented for speeding up the performance of the HMM based speech recognition system where the states are modeled by a large number of Gaussian kernels. The emission probabilities of the states are usually dominated by the nearest Gaussians to the input vector. The speedup is gained without deteriorating the recognition accuracy by concentrating on these kernels in the reduced K-best-kernel search. In this work, the time information of the input is encoded to the connections of the kernels. The search for the dominating kernels is then performed along the kernel connections which model the trajectories of the speech in the feature space. In the experiments, speaker-dependent speech recognizers were trained for ten speakers. The number of distance computations between feature vectors and kernel mean vectors was reduced 75% without increasing the average phoneme recognition error, which was 5.7% for the baseline system.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speaker Independent Speech Recognition Using Hidden Markov Models for Persian Isolated Words

متن کامل

Speaker Independent Speech Recognition Using Hidden Markov Models for Persian Isolated Words

متن کامل

Improving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM

Improving phoneme recognition has attracted the attention of many researchers due to its applications in various fields of speech processing. Recent research achievements show that using deep neural network (DNN) in speech recognition systems significantly improves the performance of these systems. There are two phases in DNN-based phoneme recognition systems including training and testing. Mos...

متن کامل

Speech Recognition Using Monophone and Triphone Based Continuous Density Hidden Markov Models

Speech Recognition is a process of transcribing speech to text. Phoneme based modeling is used where in each phoneme is represented by Continuous Density Hidden Markov Model. Mel Frequency Cepstral Coefficients (MFCC) are extracted from speech signal, delta and double-delta features representing the temporal rate of change of features are added which considerably improves the recognition accura...

متن کامل

Training Augmented Models Using SVMs

There has been significant interest in developing new forms of acoustic model, in particular models which allow additional dependencies to be represented than those contained within a standard hidden Markov model (HMM). This paper discusses one such class of models, augmented statistical models. Here, a local exponential approximation is made about some point on a base model. This allows additi...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2000

Speech recognition using temporally connected kernels in mixture density hidden Markov models

نویسنده

چکیده

منابع مشابه

Speaker Independent Speech Recognition Using Hidden Markov Models for Persian Isolated Words

Speaker Independent Speech Recognition Using Hidden Markov Models for Persian Isolated Words

Improving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM

Speech Recognition Using Monophone and Triphone Based Continuous Density Hidden Markov Models

Training Augmented Models Using SVMs

عنوان ژورنال:

اشتراک گذاری